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Chapter  XIII 


Credibility  and  Subjective  Probabilities 

13.1  In  the  preceding  chapters  we  have  generally  assumed  that 

the  decision  maker  knew  the  probabilities  with  which  the  different  states 
of  the  world  would  occur.  In  practice  a  decision  maker  will  often  argue 
that  he  does  not  really  know  these  probabilities,  although  he  is  not  so 
"completely  ignorant"  that  he  feels  he  should  use  the  Laplace  principle 
of  insufficient  reason,  i.e.  assign  equal  probabilities  to  all  states  of 
the  world. 

In  this  Chapter  we  shall  study  the  ways  in  which  such  vague 
knowledge  or  beliefs  can  be  brought  to  bear  on  the  decision  problem.  The 
ideas  which  we  shall  develop  were  first  explicitly  foriulated  by  Savage  1 5 ]  • 

To  make  our  discussion  concrete,  we  shall  consider  an  insurance 
company,  which  holds  a  portfolio  of  insurance  contracts,  and  reserve  funds 
amounting  to  S.  Let  total  claims  payable  under  the  contracts  in  the 
portfolio  be  a  stochastic  variable  with  the  distribution  G(x),  We  shall 
ass  me  that  this  distribution  is  known  to  the  company. 

The  company  will  assign  the  following  utility  to  this  situation 

CO 

U(s)  =  J  u(s-x)  dG(x) 

0 

where  u(x)  is  the  utility  function  which  represents  the  company’s 
preference  ordering.  Since  we  have  not  brought  the  tine  element  into  the 
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model,  we  shall  assume  that  all  contracts  in  the  portfolio  expire  within 
a  fairly  short  period. 

13.2.  Let  us  now  assume  that  the  conpany  is  offered  an  additional 
contract  which  will  expire  within  the  sane  period. 

Let  P  =  the  premium  paid  for  this  contract,  and  let  the 
claim  distribution  b'» 

Z  with  probability  p 
0  with  probability  1  -  p  =  q 

The  company  will  accept  the  new  contract  if,  and  only  if  it 
leads  to  an  increase  in  utility,  i.e.  if 

PU(&tP-Z)  +  q  U(S+P)  >  U(S) 

In  some  contexts  it  nay  be  more  natural  to  assume  that  company 
is  invited  to  "quote  a  premium"  for  the  new  contract.  The  equation 

pU(8+P  -  Z)  +  qU(S+P)  =  U(S) 

will  then  determine  the  lowest  premium  P  which  the  company  can  quote. 

This  is  a  very  staple  application  of  the  principles,  which  we 
have  developed  in  earlier  chapters. 

However,  in  a  real  life  situation,  the  company  nay  not  feel  so 
certain  that  the  relevant  probability  is  exactly  p. 

Our  problem  is  to  find  out  what  this  actually  mao'  mean,  and  to 
study  how  the  insurance,  company  will,  or  should,  make  its  decision  in 


this  situation 
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13.3  Let  us  first  assume  that  the  insurance  compen}  maintains 

that  it  knows  absolutely  nothing  about  the  risk  covered  by  the  new 
contract.  If  this  statement  has  any  meaning  at  all,  it  must  Imply  that 
any  value  of  p  between  0  and  1  in  eouellv  possible  -  nr  enimlly 
probable. 

It  is  then  natural  to  write  the  equation  from  the  preceding 
paragraph  in  the  following  form 

p{  U( S+P-Z)  -  U(S+P)  }  +  U(S+P)  =  U(S) 

and  multiply  by  dp,  and  integrate  from  0  to  1.  This  will  give 

i  (  U(S+P-Z)  ♦  U(S+P)  }  =  U(S) 

as  the  equation,  which  determines  the  lowest  premium  P  which  the  company 
can  quote. 

13. ^  In  practice  we  will  not  often  have  to  make  decisions  under 

complete  ignorance.  We  will  usually  have  seme  information  or  prior  belief 
about  p.  The  mere  fact  that  somebody  wants  to  pay  for  this  insurance 
contract,  indicates  that  p  is  not  zero  -  i.e.  the  event  which  will  lead 
to  a  claim  payment  is  not  impossible. 

The  usual  procedure  nay  be  as  follows:  The  actuary  and  .he 
more  or  less  experienced  underwriters,  nay  agree  that  the  "best  estimate" 
is,  say  p  =  0.10,  adding  that  this  is  little  mere  than  an  educated  guess. 
When  pressed  for  more  precision,  they  may  state  that  p  is  very  likely 
to  be  somewhere  in  the  interval  (0.05,  0.20),  or  that  they  are  certain 
that  p  <  0. 40. 
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Arguments  of  this  kind  reflect  a  vague,  but  ver^  real  feeling 
of  uncertainty.  The  most  natural  way  of  giving  precision  to  this  statement 
seems  to  be  to  specify  the  weights  which  should  be  given  to  the  various 
possible  values  of  p.  We  can  do  this  by  specifying  a  function  f(p), 
which  takes  its  greatest  value  for  the  "best  estimate"  or  "most  likely" 
value  of  p. 

There  is  obviously  nothing  to  prevent  us  from  normalizing  the 
function,  and  requiring  that 

J  f(p)  dp  *  1 

0 

This  means  that  f(p)  can  be  interpreted  as  a  probability 
distribution,  which  represents  our  belief  about  the  value  of  p. 

So  far  this  makes  sense.  The  trouble  comes  if  or  when  we  state 
something  like 

f(0,l)  =  Pr  (  p  =  0.1  )  =  the  probability  that  the 
parameter  p  is  equal  to  0.1. 

This  statement  has  no  real  meaning.  A  parameter  is  not  a 
stochastic  variable,  so  it  has  no  meaning  to  assign  a  probability  other 
than  0  and  1  to  the  "event"  that  it  takes  a  particular  value. 

13.5  If  our  insurance  company  can  specify  the  function  which 

represents  its  prior  belief,  the  decision  problem  is  solved  by  multiplying 

the  basic  equation  by  f(p)  dp  and  integrating  from  0  to  1.  This  gives 

1 

U(3+F)  +  {  U(S+P-Z)  -  U(S+F)  j  J  pf(p)  dp  =  U(S) 

0 
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or  if  we  write 

1 

P  =  J  pf(p)  dp 
0 

p  U(S+P-Z)  +  (1  -  p)  U(5+P)  =  u(s) 

This  means  that  the  company  act3  as  if  it  was  certain  the  paraoeter  has 
the  value  p 

13.6.  The  exanple  we  have  discussed,  is  very  artificial,  but  it 

brings  out  the  essential  idea  involved. 

As  a  more  realistic  example,  let  us  assume  that  the  company  is 
offered  a  portfolio  of  n  contracts  of  the  type  considered  in  the  example. 

In  this  case  the  expected  claim  payment  is  npZ  =  y  and  the  amount  of 
premium  received  is  nP. 

The  "Principle  of  Equivalence"  which  is  the  foundation  of  classical 
insurance  theory,  requires  that  expected  payments  and  receipts  shall  be 
equal,  i.e.  that  the  premium  for  each  of  these  contracts  shall  be 

P  =  pZ 

This  will,  however,  not  be  acceptable  to  a  company  which  has  a 
"risk  aversion",  i.e,  a  company  which  is  worried  about  the  possibility  that 
actual  payments  may  exceed  the  expected  veuue. 

It  is  easy  to  see  that 

Pr  (y  =  kZ}  =  (k)  pk  (l  -  p)n  k 
Hence  the  minimum  acceptable  premium  is  determined  by 
U(S)  =  k|0  u  (3  ♦  nP  -  kZ)  (k)  pk(l-p)n'k 


This  formula  takes  into  account  the  uncertainty  which  in  statistical  language 
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is  due  tc  "sampling  fluctuations". 

The  uncertainty,  due  to  incomplete  knowledge  about  the  true 
\alue  of  the  parameter,  is  logically  of  a  different  nature.  If,  however, 
we  are  willing  to  specify  a  subjective  probability  distribution  f(p), 
which  represents  our  "prior  belief",  we  can  deal  with  this  Gecond  kind 
of  uncertainty  in  the  classical  way. 

13. T  In  general  the  problem  is  formulated  as  follows: 

Claim  payment  under  a  portfolio  of  insurance  contracts,  is  a 
stochastic  variable  with  a  distribution  G(x,a),  or  we  can  take  a  to 
be  a  vector  with  mean,  variance  and  other  parameters  of  the  distribution 
as  elements.  For  the  sake  of  simplicity,  we  shall  assume  that  the 
distribution  is  continuous,  and  that 

g(x,»)  =  ss&al 
dx 

If  the  company  has  to  quote  a  premium  for  this  portfolio,  it 
will  compute  the  expected  utility 

CO 

J  U(S+P-x)  g(x,cr)  dx 

0 

If  there  is  some  further  uncertainty  about  the  parameter  a, 
expressed  by  a  prior  distribution  f(o'),  we  have  to  carry  out  another 
integration 

00 

j  {  j  U(S+P-x)  g(x,a)  dx  }  f ( O')  da 
A  0 

where  A  is  the  domain  of  a. 
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However,  the  latter  Integral  is  obviously  equal  to 
00 

J  U(S+P-x)  {  J  g(x,  a)  f(<*)  d&  }  dx 
0  A 

rrhe  inner  integral 

h(x)  =  f  g(x,  or)  f(ot)  dor 

can  of  course  be  interpreted  as  a  probability  distribution. 

This  neans  that  the  lowest  acceptable  premium  P  is  determined  by 
00 

J  U(S+P-x)  h(x)  dx  =  U(S) 

0 

This  is  a  solution  of  the  saue  form  as  the  one  we  found  earlier.  The 
whole  elaborate  reasoning  about  uncertainty  over  the  value  of  the  parameters, 
means  only  that  we  replace  the  original  distribution  g(x,0f)  by  h(xy. 

13.8  The  procedure  of  the  preceding  paragraph  has  seme  practical  value 
only  if  we  know  -  or  have  good  reasons  to  believe  -  that  claim  payments 
really  are  generated  by  a  distribution  of  the  form  g(x,a). 

In  practice  we  do  not  often  know  this,  la  insurance  one  will  usually 
analyse  the  amounts  paid  as  claim  compensation  under  a  large  number  of 
identical  contracts.  Let  us,  for  instance,  assume  that  in  a  portfolio  of 
5000  automobile  collision  insurance  contracts  we  have  observed  380  claims, 
leading  to  a  total  payment  of  $  350,000, 
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Table  1 


Claim  Payment 

0 

$  0-100 
$  101-500 
$  501-1000 
$  1001-1500 
$  1501-2000 
$  2C01-2500 
$  2501-3000 
$  3000-3500 


Number  oi’  Claims 

4620 

0 

ICO 

105 

110 

25 

30 


Let  us  further  assume  that  claim  payments  can  be  broken  down  in  more 
detail  as  shown  in  Table  1. 

In  this  situation  we  can  ignore  the  detailed  breakdown,  and  Just  note 
that  the  average  claim  payment  per  contract  is  $70  and  on  this  basis 
formulate  our  beliefs  about  claim  payments  in  the  next  portfolio  which  our 
company  w?ll  underwrite. 

A  more  "sophisticated"  approach  may  be  to  fit  a  distribution  to  the 
data  of  the  table  -  for  instance  by  the  "method  of  moments"  -  and  indl  :ate 
the  reliability  of  the  estimated  parameter.  In  doing  so  we  may,  however, 
well  have  added  new  assumptions  rather  than  new  knowledge  to  the  model, 
and  it  is  quite  possible  that  the  simpler  approach  may  be  the  sounder  policy 
13.9  Let  us  now  return  to  our  simple  example,  in  which  the  claim 


payment  could  take  only  the  values  0  or  Z,  The  only  unknown  parameter 
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was  p  =  the  probability  that  the  claim  should  become  payable. 

If  we  really  have  no  idea  about  the  true  value  of  p,  there  may  be 
something  to  be  said  for  working  on  the  assumption  that  all  values  of  p 
should  be  given  the  same  weight.  Such  cases  are,  however,  rare,  so  let  us 
therefore  assume  that  we  have  some  relevant  knowledge,  namely  that  in  a 
portfolio  of  n  comparable  contracts,  there  were  k  claim  payments. 

To  a  statistician,  it  is  then  natural  to  suggest  that  we  act  as  if 


He  will  usually  be  able  to  Justify  this  in  several  different  ways. 

There  is,  however,  some  uncertainty  about  this  estimate,  particularly  if 
n  is  email,  and  we  want  to  allow  for  this  in  our  decision. 

Let  us  therefore  write: 

Pr(k|p)  =  (  k  )  pk(l-p)n’k 

This  is  the  probability  of  the  observed  result  k,  if  the  true  probability 
is  p.  This  is  usually  referred  as  the  likelihood  of  the  observed  result. 

One  justification  for  taking  p  =  k/n  is  that  this  value  will  maximize  the 
likelihood. 

13.10  From  the  theory  of  conditional  probability  we  know  that 

Pr(k|p)  pr(p)  *  Pr(p| k)  pr(k) 
or 

Pr(p)k)  =  Mil 

Pr(k) 

Here  the  denominator  car:  be  interpreted  as  the  absolute  probability 
ii  k  -  i.e.  the  probability  of  observing  k-claims,  regardless  of  what  the 
true  probability  p  may  be. 
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We  then  have  in  a  purely  formal  way 

Pr(k)  =  L  Pr(k|p)  Pr(p) 

Here  the  sum  is  over  all  values  01'  p,  and  Pr(p)  is  the  weight  of  our 
belief  that  the  true  value  of  the  parameter  is  p.  To  express  this  result 
in  the  notation  we  have  used  earlier,  we  shall  write 

Pr(p)  =  f(p) 

Hence  our  formula  can  be  written 

Pr(pilc)  = 

f  (k)  pk(l-p)n’k  f(p)  dp 

v’ 

0 

This  is  a  special  case  01  the  classical  Bayes1  formula. 

The  formula  gives  us  the  "likelihood"  that  p  is  the  true  parameter, 
given  that  k  w as  observed.  The  formula  depends  on  the  "prior  belief", 
represented  by  the  density  function  f(p). 

Pr(plk)  can  therefore  be  taken  to  be  a  distribution,  representing 
our  belief  about  p,  when  we  combine  the  statistical  experience  and  our 
prior  belief. 

13.11  If  we  know  nothing  about  p,  except  that  k  claims  occurred  in 

a  sample  of  n,  it  may  seem  natural  to  assume  f(p)  =  1.  This  will  reduce 
our  formula  to 

Pr  ( p  I  k )  = 

J’  Pk(l-p)n"k  dp 
0 


The  denominator  in  this  expression  is  the  so-called  Beta-function 


r 
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J  P^(l-p)n  ^  dp  =  B(k+1,  n-k+ 1) 
0 


ki  (n-k)! 
( n+ 1 )  l 


We  can  nov  apply  this  result  to  our  original  problem.  We  started  in 
lera  13.3  with  the  equation 

U(S+P)  -  p  {  U(S+P)  -  u(s+r-z)  J  =  U(S) 

We  then  multiplied  by  f(p)  dp  and  integrated  over  p  from  0  to  1, 
and  obtained 

U(3+P)  -  p  (U(S+P)  -  U(S+P-Z)  }  =  U(5) 


where 


1 

P  =  J  P  f(p)  dp 
0 


In  our  present  example  we  have  to  replace  f(p)  by 


Pr(p|k)  =  - - -  Pk(l-P)n  K 

B(k+1,  n-kU) 


Substituting  this,  we  obtain 

k  +  1 

p  =  - 

n  +  2 

This  appears  as  the  probability  which  we  should  use  in  our  decision, 
if  re  want  to  combine  the  experience  obtained  by  observing  a  comparable 
portfolio,  and  our  prior  belief  that  every  value  of  p  was  equally  likely. 

We  see  that  for  k  =  0,  we  should  take  p  =  •  Th*3  ffiean3  that 

unless  n  is  very  large,  our  prior  belief  will  still  carry  seme  weight.  The 
fact  that  no  clrim  occurred  in  a  sample  of  n  insurance  contracts,  does  net 
lead  us  to  make  lUture  decisions  on  the  assumption  that  p  =  0. 


< 
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lj.12  So  far  we  have  treated  trior  belle!  as  a  very  vaT^e  concept  - 
even  valuer  than  the  utility  concept.  Our  only  concrete  result  was  that  ii 
prior  teller  shall  make  any  sense,  it  must  be  possible  to  represent  it  as  a 
prooability  distribution  -  or  a  weight  function  -  over  the  set  of  values 
which  can  be  taken  by  some  parameter. 

If  we  are  absolutely  certain  that  this  parameter  has  the  value  p  , 
the  distribution  will  be  as  in  Fig.  1. 

xr  -ip) 

Flo.  1  I  T 


1° 

II'  vc  think  any  value  between 
will  be  as  in  Fig.  2.  '[\ 


- 1 - > 

P0  ^  P 
0  and  1  as  equally  likely, 
f(p) 


the  distribution 


Fig.  2 


1 


In  the  intermediary  cases  our  prior  belief  will  be  represented  by  a 
function  as  illustrated  by  Fig.  3. 

f(p) 


Fig.  3 


13.13  From  these  considerations  it  follows  that  we  are  interested  only  in 
the  general  shape  of  the  prior  distribution.  We  want  a  distribution  function, 
which  can  represent  prior  beliefs  with  sufficient  approximation  -  or  with  as 
much  precision  as  the  decision  maker  can  express. 


We  ore  therefore  led  to  examine  if  this  class  ic  sufficiently  rich 
to  represent  all  the  prior  beliefs,  which  we  may  want  to  study. 

For  the  mean  of  the  distribution  we  find: 


J 


0 


P  f(p)  dp  = 


a  +  1 
a+b+2 


and  for  the  variance: 


a 


r\ 

d 


1 

J  (P-^)2  i(p)  dp  = 


(g»l)  (b+l) 

rs 


0  (a+b+2)‘"  (a+b+3) 

From  these  expressions  ve  see  that  if  p,  and  0  are  siven,  we  can 
usually  determine  a  and  b.  This  means  that  if  we  describe  our  beliefs 
by  specifying  only  the  two  first  moments  of  the  prior  distribution,  we 
can  always  i'ina  a  Beta-distribution  which  meets  our  specifications. 

13.15  Let  us  now  assume  that  our  prior  belief  can  be  represented  by  the 
distribution: 


f(p)  =  - 1 -  pa(l-p)b 

B(a+1,  b+l) 


Let  us  assume  that  when  we  in  this  way  have  made  up  our  mind  os  to  what 
we  believe  about  the  claim  frequency,  we  learn  that  in  a  comparable  portfolio 
of  n  contracts  there  were  k  claims.  To  make  use  of  this  new  knowledge, 
we  apply  the  formula  in  para  13.10  and  find: 


Fr(p|k)  = 


k+a/,  xn-k+b 

£ _ U-p) 


J  p^U-p) 


n-k+b 


k*-a/,  n-k+l\ 

=  £  U-JP  _ L_ 

B  ( k+ a+ 1,  n-k+b+ 1 ) 


0 


This  3ives  the  probability  to  be  used  for  our  decision: 


1 

P  =  J  P  Lr(p|k)  dp  = 
0 


k+  a+ 1 


n+a+b+1 
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If  an  jrior  belie!',  we  reel  certain  that  p  =  p^,  we  must  have: 


P 


0 


at  1 
a+b+2 


>  0 


arid 


2  _  (a+1)  (b+l)  _  , 

a  ~  (a+b+2)  -  (r+b+3) 


It  is  obvious  that  in  this  case  booh  a  arid  b  must  be  infinite. 
From  the  first  condition  we  obtain 


S  +  i 


-  +  i  +  - 

b  b 


or 


lim 


a 

b 


1-P, 


0 


From  the  expression  for  p  we  find 


P  = 


-  +  Q  +  i 

b  b  b 


-  +  -  +  1+  - 

b  b  b 


Going  to  the  limit,  we  find 


1  -  P, 


P  = 


0 


:0 


1-P, 


+  1 


0 


=  p, 


0 


This  expresses  the  obvious.  If  we  are  certain  that  p  =  p^,  we 
will  make  our  decision  accordingly,  no  matter  what  experimental  evidence 
should  become  available. 


13. 16  Ii'  in  this  example  our  prior  belief  can  be  represented  by 
a  Beta-distribution 

^  =  B(a<-1.  b+1)  p  ^1_p) 

can  give  this  a  simple  intuitive  meaning: 

Our  beliei’s  about  the  parameter  p  are  as  if  we  had  observed  that 
a  claims  were  made  in  a  portfolio  of  a+b  insurance  contracts. 

If  we  actually  had  made  this  observation,  we  would  have  some  knowledge 
or  belief  about  p.  Our  problem  is  then  to  express  formally  what  this 
knowledge  really  is.  The  natural  way  of  doing  this  -  at  least  to  a 
statistician  -  i3  to  say  that  our  knowledge  is  represented  by  a  Beta  - 
distribution  over  the  domain  of  the  possible  values  of  the  parameter. 

However  we  really  ward  to  carry  the  argument  through  in  the  opposite 
direction.  We  want  to  start  with  the  prior  beliefs  which  we  have  about  the 
parameter  p  and  give  a  precise  description  of  these  beliefs*  We  can  do 
this  by  speciiying  a  "prior  distribution",  but  it  would  be  more  attractive 
if  we  could  describe  an  experiment  and  a  particular  outcome  of  the 
experiment  which  in  some  sense  represents  our  beliefs  about  the  parameter. 

13.17  The  problem  we  outlined  in  the  preceding  paragraph  has  been 
discussed  for  more  than  50  years  -  often  In  an  obscure  language  -  by  American 
actuaries  under  the  name  oi  credibility  theory.  This  theory  was  founded 

by  Whitney  [6]  and  has  been  developed  by  Perryman  (3]>  Bailey  [l],  Carlson  l 2], 
and  others,  without  much  contact  with  the  mainstream  of  statistical  theory. 

To  illustrate  the  application  of  the  theory,  let  us  consider  an  insurance 
company  which  has  to  quote  a  premium  for  an  insurance  contract  of  the  simple 
type  considered  in  our  previous  examples.  Let  us  assume  that  the  company 
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when  making  this  decision  con  draw  cn  two  types  oi  information: 

(i)  Statistical  information  about  comparable  contracts  -  for 
instance  that  there  were  k  claims  in  a  portfolio  ci  n  identical 

similar  contracts. 

(ii)  Other  relevant  information  -  for  instance  statistical  observations 
01'  jcrtfolios  of  contracts  which  are  not  quite  comparable  to  the 
contract  in  question. 

If  there  is  sufficient  statistical  information,  i.e.  if  n  is 
large,  the  company  will  not  consider  the  other  information.  The  company 
will  act  eui  if  it  was  certain  that  p  =  k/n.  In  this  case  the  actuaries 
will  say  that  the  statistical  experience  carries  "100  per  cent  credibility", 
and  they  will  usually  be  embarrassed  if  they  are  asked  to  justify  this 
statement. 

When  the  statistical  experience  is  insufficient,  the  company  may  use 
other  relevant  information.  However,  this  is  not  possible  unless  different 
pieces  of  information  con  be  made  ccmensurabie.  This  leads  us  to  determine 
the  statistical  evidence  which  is  equivalent  to  the  ether  relevant  infor¬ 
mation  which  we  want  to  use,  and  this  is  Just  what  we  did  in  para  13.15* 

Theoretically  or.  insurance  company  should  bring  into  consideration 
additional  information  until  the  equivalent  statistical  experience  carries 
ICO  per  cent  credibility.  Hew  this  should  be  done  is  a  difficult  problem, 
which  is  lar  from  beirg  satisfactorily  solved  in  the  existing  theory.  The 
statistical  experience  of  an  insurance  company  which  has  written  1  million 
iire-insurar.ee  contracts  in  Hew  York  State  obviously  contains  information 
which  may  be  of  value  to  a  company  which  writes  fire  insurance  in  California. 


Intuitively  we  may  feel  that  this  information  is  of  less  value  than  the 
information  we  could  have  obtained  from  the  statistical  experience  01  a 
similar  company  operating  in  California.  It  is,  however,  not  easy  to 
give  a  precise  formulation  to  such  feelings.  Will  1  million  observa¬ 
tions  iron  New  York  be  equivalent  to  800,000  observations  it  cm  California"' 

If  an  insurance  company  operates  with  a  system  of  premium  rates 
derived  from  statistical  experience  which  carries  ICO  per  cent  credibility 
good  or  bad  underwriting  r^ults  will  be  explained  as  caused  by  random 
fluctuations.  These  results  will  not  induce  the  company  to  change  its 
premium  rates . 

In  practice  an  insurance  company  will  not  usually  assign  ICC  per  cent 
credibility  to  the  statistical  experience  which  constitutes  the  foundation 
of  its  premium  rates.  The  most  obvious  reason  for  this  cautious  attitude 
is  that  the  basic  probabilities  may  change  with  time. 

In  this  situation  the  company  will  accumulate  new  statistical 
experience  as  time  goes  by,  and  this  new  information  may  lead  the  company 
to  adjust  its  premium  rates.  Hew  much  the  rates  should  be  changed  will 
depend  on  the  credibility  carried  by  the  initial  statistical  experience. 
This  question  can  be  the  subject  of  heated  discussions  between  company 
representatives  and  State  Insurance  Commissioners. 

13.18  The  Beta-distribution  is  not  always  a  convenient  representation 
of  our  "prior  belief".  Let  us,  for  Instance,  assume  that  we  are  ccnsiderin 
on  investment,  where  we  know  that  the  return  is  normally  distributed  with 
unit  variance  and  mean  m,  i.e.  in  our  notation  we  have: 


-19- 


If  our  prior  belief  about  m  can  be  represented  by  the  distribution 
f(m),  we  should  then  compute 

g(x)  =  J  e  ’^X’m^f(m)  dm 
a 

and  use  this  distribution  g(x)  to  compute  the  expected  utility;  which 
will  be  our  decision  criterion. 

If  we  in  this  integral  take  f(m)  as  a  Beta-distribution  -  adjusted 
so  that  it  applies  to  the  interval  (a,b),  we  will  get  into  some  very  messy 
computations,  and  we  will  have  to  work  with  an  extremely  inconvenient 
function  g(x). 


The  natural  conjugate  distribution  in  this  case  is 


['(m)  = 


This  will  give 


the  30-c ailed  posterior  distribution  -  is  normal.  We  may  note  that  the 
variance  of  this  distribution  Is  the  sum  of  the  variances  from  the  two 


distributions  we  started  with.  The  choice  of  the  appropriate  conjugate 
distribution  has  been  discussed  in  great  detail  by  Raiffa  and  Schlaifer  [4]. 
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13.19  If  we  draw  a  sample  01  n  l'rom  a  Normal  population  with  unit 
variance,  the  stochastic  variable 

*  ‘I  (xi+  x2  +  -  Xn) 
will  be  Normally  distributed  with  variance  —  . 

If  the  population  meen  is  p  the  probability  that  the  average  of 

our  aample  shall  be  equal  to  x  ,  is  proportional  to 

n  /  -  s  2 
~2  U  ‘  V-) 
e 

The  best  estimate  of  the  population  mean  is  then  the  value  of  P, 
which  maximizes  this  likelihood  function,  l.e.  p  =  x  . 

This  means  that  if  we  represent  our  prior  belief  about  m  by  the 
distribution 

_ l _  e  “  £S3±li. 

a/2^2  ^ 

we  feel  as  confident  about  m  =  p  as  we  would  be  if  p  was  the  observed 
average  of  a  sample  of  n  =  l/a 

Ii  in  this  example  our  prior  belief  cannot  be  represented  by  a  Normal 
distribution  -  for  instance  because  our  beliefs  are  not  symmetrical  round 
some  central  value  p  ,  we  will  have  to  go  through,  more  cumbersome 
mathematics.  We  will  also  have  to  give  up  the  intuitively  attractive  idea 
of  stating  that  our  beliei's  are  equivalent  to  the  beliefs  we  would  have 
if  a  specific  experiment  had  given  a  particular  outcome. 

13.20  Let  us  now  return  to  the  results  of  para  13.7.  We  found  that 
our  final  decision  would  be  based  on  the  expected  utility 

CO 

J  {  J  u(S  +  P  -  x)  g(x,ff)  i’(a)  da  )  dx 
0  A 
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In  this  expression: 

(i)  The  prior  distribution  f (ot)  represents  what  we  believe 

(ii)  The  utility  function  u(x)  represents  what  we  want 
(lii)  The  distribution  g(x,a)  represents  what  we  know. 

All  these  three  elements  must  be  considered  in  a  rational  decision, 
and  in  an  analysis  of  the  problem  they  should  be  separated. 

In  practice  it  may,  however,  be  difficult  to  separate  what  we  believe 
from  what  we  know.  In  our  simple  example  the  place  of  the  distribution 
g(x,or)  was  taken  by  the  binomial 

(5)  Pkd-P)  n'k 

where  p  is  the  unknown  parameter.  This  binomial  distribution  rests 
on  the  assumptions: 

(i)  The  probability  of  a  claim  in  the  same  under  all  the  n  contracts 

(ii)  The  probability  of  a  claim  under  an  arbitrary  contract  is  indepen¬ 

dent  of  whether  claims  have  been  made  under  any  of  the  n-1  other 
contracts 

If  we  know  that  these  assumptions  are  true,  there  is  no  problem.  If, 
however,  uhese  assumptions  Just  represent  our  beliefs,  or  are  accepted  as 
working  hypotheses,  they  should  be  included  in  f(or)  and  not  in  g(x,a). 
This  means  that  the  separation  of  the  different  elements,  which  seems 
essential  to  a  rational  analysis  of  the  decision  problem,  is  by  its  very 
nature  arbitrary.  This  again  means  that  in  a  preliminary  study  only  the 
general  shape  of  the  functions  f,  g,  and  u  is  significant,  and  that 
we  should  feel  free  to  choose  functions  which  are  easy  to  manipulate 


mathemat  ic  ally . 
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